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Abstract 

A statistical method which uses a combination of two subdetectors to 
monitor the luminosity in high energy interactions is presented. To illus- 
trate its performance, this method was applied to random triggered min- 
imum bias data collected in the commissioning period of the HERA-B 
experiment in spring 2000. It is found that luminosity estimates with 
an intrinsic systematic error of 3% can be obtained. 

1 Introduction 

The precise determination of the luminosity of experimental data is required 
for absolute cross section measurements. Luminosity I is defined as the pro- 
portionality factor between interaction rate and cross section for the process 
under consideration. The integrated luminosity L relates cross section a and 
interaction count N for a time interval T, i.e. 

l(t) = -^L and thus L = I l(t)dt = — . (1) 
a at J a 

Given the cross section a for a particular process, such as the inelastic 
cross section in high energy hadronic interactions, the determination of the 
integrated luminosity for a given data set is equivalent to determining the 
number of interactions of that process. For the following we will focus on col- 
lider experiments, where a bunched beam produces events with a well defined 
time structure, and where the number of interactions per bunch crossing will 
fluctuate statistically. 

As a first approach, determining the number of interactions could be accom- 
plished by simply counting the number of reconstructed primary vertices in the 
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data. To achieve that, the vertex reconstruction efficiencies must be known. 
Additionally, two neighboring vertices may be merged by the reconstruction 
package while others may be split. Therefore, the probabilities for these pro- 
cesses must also be calculated, which is often difficult and introduces poorly 
known systematic errors. An alternative technique consists in extracting the 
total number of interactions from inclusive quantities in the reconstructed data 
which are proportional to the number of primary collisions in an event, such 
as the number of hits or the total energy deposition. Obviously, the main 
difficulty associated with this approach is the need for an absolute calibration, 
that is, the average signal for a single interaction must be known. 

In a different approach (the so called "statistical method"), a poissonian 
distribution for the number of interactions per bunch crossing is assumed and 
the average number of interactions is extracted from the number of empty 
events in the data sample. The advantage of this method is that nothing 
about the average signal for a single interaction has to be known or assumed. 
However, the acceptances for tagging non-empty events must be estimated and 
the occurrence of noise events, which may be tagged as non-empty, must be 
taken into account. 

In this paper a method for determining the integrated luminosity by count- 
ing the fraction of empty events simultaneously in two subdetectors is pro- 
posed. With this procedure the detector acceptances for a single interaction 
and the fraction of noise events can in principle be obtained from data, relax- 
ing the dependence on Monte Carlo simulations to derive these quantities and 
the introduction of systematic errors which are difficult to estimate. 

This paper is organized as follows. In the next section, an expression for the 
probability to observe an empty event is derived, assuming that the distribu- 
tion of the number of interactions per bunch crossing follows Poisson statistics 
but allowing also for non-negligible rate fluctuations. Based on this, in Section 
3, the Two-System Statistical Method (TSSM) is introduced. It is shown how 
counting the fraction of empty events in either of two subdetectors and si- 
multaneously in both allows to determine the acceptance of both subdetectors 
and the mean number of interactions in the data. In Section 4, this procedure 
is applied to minimum bias events collected in the commissioning period of 
the HERA-B experiment in spring 2000. The conclusions are presented in 
Section 5. 

2 Counting Empty Events 

A particle collider usually has circulating beams with many bunches contribut- 
ing to the observed interaction rate. Since the individual bunch currents, in 
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general, can differ by significant amounts, the following analysis is formulated 
for an ensemble of distinguishable bunches. This entails a slight complication 
of the formalism, but, as will become clear later, gains a lot of information 
which can be exploited in the analysis. 

For the start let us assume that the distribution of the number of interac- 
tions per bunch crossing follows Poisson statistics. If the average number of 
interaction produced by bunch number i is the probability that there are 
n interactions in an event from this bunch crossing is 

V(n,i) = ^e-Vi. ( 2 ) 

Now suppose that a certain subdetector is used to count the number of empty 
events in the data set. An event is tagged as being empty if a quantity associ- 
ated with this subdetector (e.g. hits, tracks, energy deposition, etc.) is below 
a specified threshold value. This value represents a compromise between a 
large efficiency for tagging non-empty interactions and an effective exclusion 
of noisy "events" in which no interaction has occurred. The probability to 
observe an empty event in this system is 

oo 

P(0,i) = (l-g)^(l-aW)7>(7M), (3) 

n=0 

where <zM is the acceptance, or efficiency, to tag an event with n interactions 
as non-empty and q is the probability to observe an event due to noise in the 
subdetector or to background (i.e. beam gas interactions). If the probability 
to pass the tagging threshold is independent of the number of primary inter- 
actions, which to a good approximation is valid if the threshold is set such 
that a single interaction has a large probability to exceed it, then can be 
approximated by 

a (n) = 1 - (1 - a) n (4) 

where a = at 1 ' is the efficiency to tag a single interaction as non-empty. Sub- 
stituting Eq. (J2J) and (j3J) into Eq. (j2J) one gets 

P(0,i) = (1 -q)e~ a ^. (5) 

However, some bunches may suffer from rate instabilities so that Eq. (J2J) 
does no longer describe the interaction multiplicities correctly. In this case, 
the average number of interactions fii is no longer constant but it fluctuates 
by a random amount v\ around its central value, \ii — > fa + Vi. With g{vi) the 
probability density function of those fluctuations, the probability to observe 
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an empty event becomes 

P(0, z) = (1 - q) ^ e~ a ^ + ^g^dUi (6) 

Assuming further that the fluctuations around fii are Gaussian distributed, 
with zero average {u)i = and standard deviation tjj <C //j, Eq. (JHJ) can be 
integrated analytically to yield 

P(0,i) = (l-g)expf-a^ + ~aV?J . (7) 

One sees that rate fluctuations enter as second order effects, i.e. as long as 
they are small the assumption of poissonian distribution for interaction mul- 
tiplicities is a good approximation. Large rate fluctuations, however, have a 
sizeable impact and have to be taken into account in the analysis. 



3 The Two-System Statistical Method 

Now let us consider two subdetectors or combinations of subdetectors, which 
will be denoted by "system 1" and "system 2". According to Eq. (JJJ) the 
probabilities p k , k = {1,2} to observe an empty event in either of the two 

systems are 

p ki = P(0, i) k = (1 - q k ) exp (-a h fM + J ( 8 ) 

where q k is the probability to record an event due to background or noise in 
system k and a k is the efficiency to tag single interactions in this system. If 
the two systems are independent the probability po to observe an empty event 
simultaneously in both subdetectors is given by an analogous expression 

Poi = (1 - <7o) exp (^-a fii + ^l a i) > ( 9 ) 

where q = c?i + g 2 - <?i<?2 and a = a x + a 2 - a x a 2 . 

In order to get a handle on rate fluctuations, we now combine the statistical 
approach with a measurement based on an inclusive quantity, which is insen- 
sitive to deviations from a poissonian for the interaction multiplicities. This 
is achieved by expressing //, in terms of a bunch dependent inclusive quantity 
(n)i which is proportional to the number of interactions per bunch crossing, 

(n)i = Tfii. (10) 
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The parameter r is the mean value of the inclusive quantity per interaction 
within the detector acceptance. Substituting Eq. (JTUJl in Eq. (jHJ) and we 

have 

Phi = (1 - q k ) exp (-CLk^ 1 + > with fc={0,l,2}. (11) 

If there are no rate fluctuations, Oi ~ 0, Vz, the global (bunch independent) 
parameters, qk, a k and r, can be obtained from Eq. (fTTj) by fitting the values of 

as a function of the observable (n)j. Once r is known, the average number 
of interactions fa for every bunch is calculated according to Eq. (fTUj) . From fa 
and the number of recorded events in each of the bunches, the total number 
of interactions in the data sample and thus the integrated luminosity can be 
calculated. 

In case that rate fluctuations are present for some bunches, those bunches 
have to be identified and removed from the global fit. This can be achieved by 
considering the following relation between the probabilities p k i, 



n POi 

In = ai<2 2 

P\iP2i 



(n)i 2 
' + 07 I 1 - a - -aia 2 



t V u 2 



(12) 



In case of negligible rate fluctuations we have a simple linear relation between 
\^{poi/piiP2i) and (n)i. Bunches with significant rate fluctuations would deviate 
from that relation and can be excluded from the global fit. Note that Eq. ()12j) 
also has the potential to detect situations where all bunches are subject to rate 
fluctuations. In this case one has no outlier bunches, but a straight line fit to 
ln.(poi/piiPii) versus (n)i would not pass through the origin, unless of and fa 
are proportional. 



4 Application to HERA-B Minimum Bias Data 

To illustrate its properties, the proposed method was applied to minimum bias 
events, collected with a simple random trigger during the HERA-B commis- 
sioning period in spring 2000. Applying the TSSM to real data shows how the 
considerations that went into its design cope with problems arising under real- 
istic conditions. With respect to HERA-B, please note that the random-trigger 
based method described in this paper should not be confused with other ones 
employed by the HERA-B collaboration for luminosity measurements, such as 
for example the method S2J applied to the interaction-triggered data recorded 
in 2002/2003. 

HERA-B is a large acceptance fixed target experiment that studies the 
interactions of 920 GeV protons with wire targets placed in the beam halo 
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of the HERA storage ring, at DESY. The HERA-B target [3] consists of two 
stations separated by 4 cm along the beam. Each station comprises four wires 
of different materials, with dimensions ranging from 0.5 to 1 mm along the 
beam and from 50 to 100 /iin perpendicular to the beam. Each wire can be 
independently moved inside the beam halo in order to adjust the interaction 
rate. The reconstruction of primary and secondary vertices is performed by 
a silicon micro-strip Vertex Detector System jlj. The main tracker is divided 
into the Inner Tracker jH], composed of micro-strip gas chambers with gas 
electron multipliers, and the Outer Tracker [Hj made of honeycomb drift cells. 
Particle identification is performed by a ring imaging Cherenkov detector [Jj, 
an electromagnetic calorimeter jH] and a muon detector 0. 

The runs analysed in the following were taken with four different target 
materials: carbon, aluminum, titanium and tungsten. The nominal interaction 
rates ranged from 2 to 20 MHz. In the HERA proton ring there are 220 slots for 
bunches separated by 96 ns. Usually only 180 of these are filled with protons. 
These are organized in three groups of 60 bunches, separated by three gaps of 
5+5+15 empty slots. In turn, each group is composed of 6 subgroups of 10 
contiguous bunches. These subgroups are separated by a single empty slot. 

In Fig. ^ one can see the number of recorded events as a functions of the 
bunch number for a run of 500k events taken with a carbon wire target. As 
can be seen, the data acquisition system samples all bunches very uniformly 
(even the ones which are nominally empty). 
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Figure 1: Number of events recorded with a random trkre function of 

bunch number. Bunches that are nominally empty are also sampled. 

At HERA-B inelastic interactions dominate the total visible cross section 
and therefore they are the natural reference process for luminosity determina- 
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tion. Elastic scattering events are normally outside the detectors acceptances 
and contribute marginally to the rate. The inelastic cross section of pA col- 
lisions was measured by several fixed target experiments, for a large number 
of target materials and beam energies jH3 HU H21 EH UH HHj- It is found to 
be approximately independent of the incident particle energy and a power law 
dependence on the target atomic weight A is well fitted by the experimental 
data • The inelastic cross section comprises a non- diffract ive and a diffrac- 
tive component. Since for the latter both experimental acceptance and the 
contribution to the total cross section are small, it is a good approximation 
to assume that only the non- diffract ive component of the inelastic cross sec- 
tion contributes to the luminosity determination. The resulting bias can be 
estimated by Monte Carlo simulations. 

4.1 Mean number of tracks per bunch crossing 

In Eq. (fTUj) Hi was expressed in terms of an inclusive quantity which is pro- 
portional to the number of interactions per bunch crossing. In the following 
we choose this quantity to be the mean number of reconstructed tracks (n t ) 
which, to a good approximation, scales linearly with the number of primary 
collisions, i.e. (nt) = 77^, where r is the mean number of reconstructed tracks 
in one interaction. The validity of this assumption was checked with a Monte 
Carlo simulation based on the FRITIOF 7.02 generator J7j and the subse- 
quent simulation of the HERA-B detector. In order to exclusively select tracks 
originating from primary interactions, and eliminate non-target related tracks 
from secondary decays such as K$ — > 7r + 7r~ and conversions 7 — > e + e~, the 
following selection criteria were applied to all tracks in the event. Only tracks 
containing at least 6 reconstructed hits in the vertex detector (VDS) 
cepted. To avoid counting multiply reconstructed tracks (the so-called clones), 
tracks sharing a VDS segment with a previously accepted track were rejected. 
Finally, an impact parameter below 1 mm at the primary vertex is required. 

The plot on the left of Fig. El shows a Monte Carlo simulation for the mean 
number of reconstructed tracks (n t ) n which satisfy these criteria as a function 
of the number n of superimposed interactions. One sees that (n t ) n indeed 
scales linearly with interaction rate up to 4 superimposed interactions, which 
corresponds to a rate of about 40 MHz. Furthermore, heavier target materials 
yield higher track multiplicities. The plot on the right of Fig. El shows (n t )i as 
a function of the bunch number % for the same run considered in Fig. ^ There 
are remarkable variations in the track multiplicities between different bunches, 
which clearly indicates distinct contributions to the total rate. In this plot, we 
can also identify the bunches which are nominally empty and contribute only 
marginally to the rate. 
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Figure 2: Average number of reconstructed tracks as a function of the number 
of superimposed primary interactions, given by a Monte Carlo simulation (left); 
Average number of reconstructed tracks as a function of the bunch number for 
a run taken with the carbon wire (right). 

4.2 Defining the systems 

In principle, any subdetector or combination of subdetectors in the experiment 
can be chosen as a system for counting empty events. In order to minimize the 
dependence on Monte Carlo simulations, the requirement is a large acceptance 
for tagging non-empty events, reasonably low noise levels and good stability 
with time. We have chosen the most stable subdetectors in the data taking 
period of year 2000. System 1 consists of the vertex detector (VDS). An event 
is not empty in this system if: 

• there is at least 1 reconstructed track satisfying the track selection cri- 
teria explained above. 

System 2 is a combination of the ring imaging Cherenkov counter (RICH) 
and the electromagnetic calorimeter (ECAL). An event is considered to be not 
empty in this system if the following conditions are both fulfilled: 

• there are at least 30 reconstructed hits in the RICH. 

• the deposited energy in the inner part of the ECAL is above 5 GeV. 

In Fig. El we can see the distributions of number of tracks satisfying the 
track selection criteria (a), number of hits in the Cherenkov detector (b) and 
the total energy deposition in the inner part of the electromagnetic calorimeter 
(c) for a run taken with a carbon target wire. 
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Figure 3: Distributions of (a) number of tracks satisfying the track selection 
criteria, (b) number of hits in the RICH, and (c) energy deposition in inner 
ECAL, for a run taken with the carbon wire. The inserts are a zoom to the 
first bins for each distribution. 



In Fig. |U we can find the probabilities pki, k = {0,1,2}, estimated as 
the fraction of empty events in system 1 (a), the fraction of empty events in 
system 2 (b) and the fraction of empty events in both systems (c), for the 
180 nominally filled bunches. Again, remarkable variations are found between 
bunches, indicating different contributions to the total interaction rate. 
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Figure 4: Fraction of empty events as a function of bunch number in (a) system 
1; (b) system 2; and (c) both systems. 

Figure El shows ln(l/pi), ln(l/p 2 ) and ln(p /piP2) as a function of (n t ) for 
two different runs. Each entry corresponds to one bunch. The top row is 
for a run taken with the carbon wire and very small rate fluctuations. The 
bottom row corresponds to a run taken with an aluminum wire and large 
rate fluctuations. The global parameters are obtained from an unweighted 
linear fit, performed after the bunches subject to rate fluctuations have been 
removed from the fit. These bunches are identified according to the constraint 
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on the probabilities pki given by Eq. (j!2j) . In Fig. El one can see the ratio 
ln(p / 'P1P2) I '( n t) j which should be approximately constant for negligible rate 
fluctuations. For the run taken with the carbon target wire (left plot) this ratio 
is reasonably constant, indicating the absence of significant rate fluctuations. 
On the other hand, for the run taken with the aluminum target wire there are 
bunches which are subject to deviations from Poisson statistics which can be 
identified by having a lower than average value for ln(po/PiP2)/(^t)- Notice 
that these bunches can also be identified in Fig.EJT) below the main line (where 
entries concentrate), since for a given (n t ) they will have a lower ln(po/piP2)- 
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Figure 5: Plots of ln(l/pi), ln(l/p 2 ) and \n(p /pip 2 ) as a function of {n t ). Each 
dot represents a bunch. The plots in the top row refer to a run taken with 
carbon target; the plots below refer to a run taken with aluminum target. The 
global parameters are obtained from a linear fit. 

In Table 1 we give the average values over all runs of the efficiencies ai t 2, 
the noise-probabilities qi^ and the average number of tracks per interaction 
r, obtained from the global fits to the selected runs. It can be seen that the 
efficiencies for system 1 are typically larger than for system 2. On the other 
hand, the efficiencies are, within errors, quite similar for all target materials. 
However, we could expect them to be larger for heavier materials, which yield 
higher track multiplicities. The probabilities are similar for runs acquired 
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Figure 6: Values of ln(po/piP2)/(wt) as a function of bunch number for a run 
taken with the carbon wire (left) and the aluminum wire (right). 



a i 



(i-2 



qi 



92 



T 



C 0.95 ±0.02 

Al 0.93 ±0.02 

Ti 0.97 ±0.02 

W 0.96 ±0.06 



0.86 ±0.02 
0.83 ±0.02 
0.86 ±0.02 
0.87 ±0.05 



0.0189 ±0.0003 
0.0516 ±0.0008 
0.0212 ±0.0003 
0.0177 ±0.0002 



0.0116 ±0.0001 
0.0535 ±0.0011 
0.0142 ±0.0002 
0.0126 ±0.0001 



7.69 ±0.26 
8.27 ±0.24 
9.95 ±0.25 
13.23 ±0.99 



Table 1: Bunch independent variables obtained from global fits to nominally 
filled bunches. 



with carbon, titanium and tungsten targets, but larger for runs acquired with 
the aluminum target. This fact may be explained by the large fraction of 
coasting beam (unbunched protons uniformly distributed under the pulsed 
bunch structure) which plagues all runs taken with this wire ^H|- Furthermore, 
because the runs taken with the aluminum wire target tend to show large rate 
instabilities it is natural to speculate if these are related to the presence of 
coasting beam. 

The mean number of tracks per interaction r increases, as expected, with 
the atomic weight of the target material. This dependence is usually param- 
eterized by a power law of the atomic weight: r oc If we fit the values 
of function of the target atomic weight A, we obtain (3 = 0.20 ± 0.02, 

which is statistically compatible with the result (3 = 0.18 ± 0.02 obtained in 
an independent study employing the HERA-B vertex detector [TH] . 

Once r is known, the average number of interactions per bunch crossing 
Hi can be calculated according to Eq. (JTUJ). Figure shows the values of ^ 
for all bunches, for a run taken with the carbon target wire (a) and with the 
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aluminum target wire (b). First, it is noteworthy that bunches contribute 
quite differently to the rate. In the run taken with aluminum target we can 
see a large contribution of nominally empty bunches to the total rate. This 
behaviour can be observed in other runs taken with aluminum wire and, again, 
this can be explained by the large fraction of coasting beam which is present 
in all runs taken with this wire. 
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Figure 7: Mean number of interactions as a function of bunch number for a 
run taken (a) with a carbon target wire, and (b) with an aluminum target 
wire. In plot (a) it can be seen the nominally empty bunches which contribute 
marginally to the interaction rate. In plot (b) these bunches contribute sig- 
nificantly to the rate, which is a clear indication of the high levels of coasting 
beam affecting this run. 

The total number of interactions N int in a run is given by N int = Y^i=i -^A^> 
where Ni is the total number of events due to bunch i. From Ni nt the luminosity 
is obtained using Eq. (JTJ) and the inelastic cross sections published in Ref. |TKj . 



4.3 Systematic uncertainties 

Because the final states of proton-nucleus interactions sample a large phase- 
space, certain event topologies may be outside the acceptance of both subde- 
tectors, leading to systematic uncertainties in the measured luminosity. Events 
which are not seen by both systems do not contribute to any inefficiency as 
inferred by the TSSM, and thus lead to an overestimate of the true acceptance. 
The systematic uncertainties of the statistical method were studied with a toy 
Monte Carlo based on the interaction model MINT [20] arid a coarse simula- 
tion of the HERA-B detector based on angular acceptance cuts, some rough 
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estimates for the track finding efficiencies plus some assumption about noise 
and smearing in the RICH and ECAL. The impact on the measured lumi- 
nosity of diffractive contributions, rate fluctuations by ±20%, target materials 
covering the range from Carbon to Tungsten and nominal interaction rates 
varying by a factor ±e were considered. It is found that for a detector such 
as HERA-B, there is a small bias on the luminosity estimate from the TSSM. 
Assuming that the reference cross section is the total inelastic cross section, 
the luminosity estimate is between 3% and 6% too small. Using instead only 
the non- diffractive inelastic cross section as a reference, the results are between 
1% and 6% too high. Taking conservatively the larger of the two ranges and 
correcting for the average bias, we conclude that the intrinsic systematic error 
of the TSSM is around 3%. Note that this figure does not include systematic 
uncertainties due to imperfect knowledge of the contributing cross sections. 

5 Conclusions 

A statistical method to measure the integrated luminosity of high energy in- 
teractions at collider experiments was presented. The method starts from the 
assumption that the number of interactions in a random triggered event follows 
Poisson statistics. Then, two large acceptance subdetectors of the experiment 
are considered. Counting the fraction of empty events in either of the two 
subdetectors and simultaneously in both, as function of the bunch crossing 
numbers, allows to infer the acceptance of the two subdetectors, noise contri- 
butions and total number of interactions from the data alone, thereby reducing 
the dependence of the analysis on Monte Carlo simulations. Introducing also 
information from an inclusive quantity, the method was implemented such that 
a bias due to rate fluctuations, which tend to spoil the assumption of Pois- 
son statistics for the interaction multiplicity of a given bunch, can be avoided. 
This method was applied to random triggered minimum bias data collected in 
the commissioning period of the HERA-B experiment in spring 2000. Without 
correcting the luminosity estimates for the bias caused by those parts of the 
cross section which are not seen by either of the two sub-systems considered, 
the TSSM would have an intrinsic systematic error of 6%. For more hermetic 
detectors and at higher energies even smaller uncertainties can be expected. 
Correcting for the bias, the intrinsic systematic error of the method drops to 
3%. 
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